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@ A method for producing in vivo stable 
single-stranded DMAs in eucaryotic cells. The 
DNAs are multicopy single-stranded DNA 
(nrtsDNA) stmctures constituted by an RNA and 
a DNA portion. The group of genes (retrons) 
producing said coupled RNA and DNA portions 
of the msDNAs and the gene encoding reverse 
transcriptase (RT). The transformed eucaryotes 
harboring these retrons. The new msDNAs 
which are encoded by the new retrons. UtOities 
are disclosed. 
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HELD OF THE INVENTION 

The invention concerns the field of recombinant DNA. More particularly, the invention relates to an in vivo 
method of synthesis of stable single-stranded cDUA in eucaryotic cells by means of a bacterial retroelement 
5 called a retron. The invention also relates to new eucaryotic vectors carrying the necessary elements to pro- 
duce the single-stranded DNA-RNA hybrid structures. Moreover, the invention relates to transfected eucar- 
yotes, e. g. , yeast, plant cells and mammalian cells. Uses are described for the new products. 

BACKGROUND 

10 

Gram-negative bacteria such as Mvxococcus xanthus. Stigmatella aurantiaca and Escherichia ooii have 
been found to contain a retroelement called a retron. In TIBS . 16, 18-21 (1991a). the authors report on a pe- 
culiar type of sattelite DNA, named multicopy single-stranded DNA(msDh4A). These molecules are character- 
ized by a structure which comprises a single-stranded DMA branching out of an internal guanosine residue of 

15 a single stranded RNA molecule by a unique 2', 5'-phosphodiester linkage. These molecules are thus single- 
stranded DNA-RNA hybrids. Reverse transcriptase is required for the synthesis of these msDNAs. In Ann. Rev. 
Microbiol.. 45. 163-186 (1991b). the authors present a comprehensive review on msDNAs. Also see msDNA 
in Bacteria. Lampson et al^ Progress in Nucleic Acid Researoh and Molecular Biologv . 60, 1-24. . 

The production of single-stranded cDNA by reverse transcriptase as a template is an obligatory step for 

20 RT -mediated transcription of retroelements. See Retroelements . See Weiner et al., Ann. Rev. Biochem. . 55, 
631-661 (1986) for Review. This includes integration of retroviruses Into mammalian genomes, productton of 
infectious retroviruses from pro-viruses integrated into genomes, rotrotransposition of retroelements, and for- 
mation of pseudo genes in eucaryotic cells . 

However, single-stranded cDNAs produred in yiyo by RT have never been directly detected, probably be- 

25 cause of their instability. 

While the production of msDNAs in bacteria has been a most significant development, the in vivo produc- 
tion of single-stranded DNAs in eucaryotic cells, e^ yeast or higher eucaryotic cells like plant and nnammalian 
cells, is of even greater interest. Eucaryotes have well-known advantages over procaryotes for producing target 
molecules. There is an important need to produce stable single-stranded DNA in a sufficient yield for numerous 

30 practical uses in research and in industry. This invention has made an important contribution in that respect 
in produdng single-stranded RNA-DNA structures which are detectible, stable and useful. 

SUMMARY OF THE INVENTION 

35 In accordance with the invention, afundamental finding has been made. It has been discovered that single- 
stranded DNAs which are stable can be produced in vivo in eucaryotic cells. 

Briefly described, the invention provides a method (or process) for producing in vivo stable, single-strand- 
ed DNAs in eucaryotic cells like yeasts or plant cells or mammalian cells. The method of the invention produces 
a single-stranded cDNA by means of a retroelement called a retron. The single-stranded DNA is produced as 

40 an integral part of a branched RNA-linked multicopy single-stranded DNA(m3DNA) structure. These structures 
are stable, Le^ detectible after production and isolation in spite of the fact that they are constituted of RNA 
and DNA. both single-stranded. The method of the invention also provides such msDNAs which contain foreign 
DNA and RNA fragments in the DNA and RNA portions, respectively, of the RNA-DNA structure. Though dif- 
ferent from the known bacterial msDNAs, these molecules are designated as msDNAs or "nnodrfied" msDNAs. 

45 because they have the characteristics and unique features of msDNAs as described herein. 

The invention also provides retrons. Retrons are genetic elements which contain the coding region msr 
for the msRNAand msd forthe msdDNAof the msDNA molecule, respectively, and the gene for reverse tran- 
scriptase (RT). The retrons which are new in accordance with the inventton, have sequences which are different 
from known bacterial retrons in that the non-coding region has been shortened, specifically the region between 

50 the transcripttonal initiation site of the selected promoter and the initiation codon of the RT gene. 

The invention also provides retrons which are new by virtue of the fact that, unlike known bacterial retrons, 
the RT gene is positioned upstream of the msr-msd region, in reverse relationship of that in bacterial retrons. 
These new retrons produce greater yields of msDNAs. 

The invention further provides new types of msDNAs which are new by virtue of having been produced 

55 by the novel retrons. These msDNAs contain a foreign DNA fragment in their DNA portion, for instance, a single- 
stranded fragment complementary to the mRNAof a particular target gene (antisense DNA) and thus, may 
be valuable tools to inhibit or change the expression of undesirable proteins. Similarly, msDNAcan also contain 
a foreign RNA fragment. 
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Further novel embodiments of the invention are transformed eucaryotic hosts with retrons which have 
been identified from bacterial sources. Also new are eucaryotic hosts transfected with the new vectors dis- 
cussed above. Various uses for the new single-stranded RNA-DNA structures are described. 

5 DEPOSIT OF GENETIC MATERIAL 

Plasmid YEp521-M1 has been deposited with the American Type Culture Collection (ATCC) under Acces- 
sion No. 74092. 

Plasmid YEp521-M4 has been deposited with the ATCC under Accession No. 74093. 
10 Plasmid YEp521-M5 has been deposited with the ATCC under Accession No. 74094. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates the biosynthetic pathway of msDNA synthesis. 
15 FIG. 2A shows the structure of the typical bacterial msDNAs. FIG. 2B shows the structure of msDNA- 
Ye117. 

FIG. 3 shows the arrangement of genes in the retroelement responsible for the production of msDNAs. 
FIG. 4 shows a comparison of the domain structurBS of various bacterial RTs. 
FIG. 5 shows plasmids YEp51 and YEp52. 
20 FIG. 6 shows the restriction map of the 11 .6-kb Eco R1 fragment 

FIG. 7 shows a diagrammatic representation of plasmid PC1-1BPv4, YEp521-M1. YEp521-M2, YEp521- 
M3, YEp521-M4 and YEP521-M5. 

FIG. 8A shows bands a and b of a sequence polyacrylamide get for the reduction of msDNA-Ec67 and 
FIG. 8B shows a schematic representation of extension of the 3' end of msDNAby AMV-RT and RNase A treat- 
25 ment 

FIG. 9 shows Southern blot hybridization of msDNA-Ec67 produced In S. cerevisiae . 

FIG. 10 shows a diagramn^tic representation of plasmid YEp521-M5. Darkened region in the retron rep- 
resents 50-bp antisense DNA for cdc28 (cloned into the Xho l site) inserted into the msd region of retron Ec67. 
Also shown is the 50-bp antisense DNAforcdc28. 

30 

DETAILED DESCRIPTION OF THE FIGURES 

FIG. 1 Biosynthetic pathway of msDNA synthesis. The retron region consisting of the msr-msd region and 
the gene for reverse transcriptase (RT) is shown on the top of the Figure. Solid arrows indicate the locations 
35 oftwo sets of inverted repeats (a1 anda2»andb1 and b2). Open arrows indicate the genes for msd RNA (msr) . 
msDNA (msd), and RT. The primary transcript is considered to encompass the upstream region of msr through 
the RT gene, which is shown by a thin line at step 1. The thick region in the RNA transcript corresponds to 
the final msd RNA. The branched G residue is circled, and the initiation oodon for RT is also shown. On the 
folded RNA, a triangle indicates the 5' end processing site at the mismatching base. The dotted lines at steps 
40 3 and 4 represent DNA strands. 

FIG. 2 (A) Structures of hybrid DNA-RNA msDNA arB shown as follows: Mx162, Mx85, Sal 63, Ec107. 
Ec67, Ec86 and Ec73. (Ann. Rev. Microbiol.. 45, 163-188 (1991)) (B) Structure of hybrid DNA-RNA msDNA- 
Ye117 is shown. The hatched zone represents the anti-sense DNA of cdc28.- 

FIG. 3 Arrangement of genes in the retron element responsible for the production of msDNA. Asingle-copy 
45 retroelement on the bacterial chromosome contains the region required forthe production of msDNA. All known 
msDNA coding regions contain three genes organized in a similar manner, as shown in (A): Agene, msd, codes 
for the DNA strand of msDNA. A second gene (msr) is situated, 5' to 3', in the opposite direction and codes 
forthe RNA strand of msDNA. A closely positioned ORF codes for the RT. Transcription of this region initiates 
at or near the 5' end of msr and extends beyond msd to include the ORF. A set of inverted repeat sequences, 
50 a1 and a2, is also conserved among msDNA coding regions (short arrows). The circled G corresponds to the 
residue in the RNA that wOl contain the 2', 5' branch linkage in msDNA(see also FIG. 8A). (B) Forthe E. coli 
retron Ec67, the region encoding msDNA is only a small part of a large element found on the chromosome 
(open bar). The junctk)n of the Ec67 retron with the host chronriosome is flanked by 26-base directly repeated 
chromosome sequences, as shown by arrows. The Figure is not drawn to scale. 
55 FIG. 4 Domain structures of various bacterial RTs. The regions with closed bars and with stippled bars 

represent the RT and RNase H domains, respectively. 

FIG. 5 Yeast expression vectors YEp51 (7. 3-kb) and YEp52 (6.6-kb), The structures of two yeast expres- 
sion vectors are diagrammed. Both are composed of sequences from the yeast plasmid 2-^m circle (smooth 
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single line) spanning REPS (eza) and the origin of replication ("). from the bacterial plasmid pBR322 (jagged 
single line) spanning the C0IEI origin of replication and the gene conferring anpicillin resistance, from the yeast 
genome spanning the gene LEU2 ( g,_3,), and from the region 5' to the yeast GAL1Q gene (□). extending from 

5 the SauSA site at -495 from the transcription-initiation site to the Sail site present in plasmid pNN78-A4 at 
+13. Acloned gene inserted in YEp51 in the Sail. Sall-to- Bam HI . Sall-to-Hindlll. or Salt-to-Bcll sites pointed 
labelled I in the Figure, terminating at a site in the 2-fim-circle sequences indicated by the blocked arrow (T). 
Similar transcription would be obtained with genes inserted in the Hindlll or Hindlll to Bdl sites of YEp52. Re- 
striction enzymes: R, EcoRI; H. Hindlll: B. BamHI; S, Sa^K P. Pstf ; Be, Bdl. See Broach et al. . Experimental 

10 Manipulation of Gene Expression, Academic Press Inc., New York, 1983. 

FIG. 6 Restriction map of the 1 1 .R-kb Eco RI fragment In the CI-1 E map, the left-sand half (EooRI to Hindlll) 
was not mapped. In the C11EP5 map. the locations and the orientations of msDNAand msdRNAare indicated 
by a small arrow and an open arrow, respectively. A large solid arrow represents an ORF and its orientation. 
See Lampson et aL, Science . 243, 1033-1038 (1989). 

15 FIG. 7 Diagrammatic representation of plasmid PC1-1BPv4. YEp521-M1,-M2,-M3.-M4 and 1^5. Diagrams 
show only the regions (shaded bars) inserted in the yeast vector. YEp521 . These regions contain retron-Ec67 
and restriction sites shown are only those which are used for the construction of plasmids. Short arrows with 
msr or msd are the locations and the orientations of msdRNA and msDNA. Long arrows with RT represent 
the gene for RT and its orientation. Thick arrows represent the GAL10 promoter and Its orientation of tran- 

20 scription. Lettere on top of bare are the sites of restriction enzymes: H, Hindlll; Ba, Ball; Pv, Pvull; B, Bam HI; 
and S, Smal . 

FIG. 8 A sequence polyacrylamide gel of the production of msDNA-Ec67 in S. cerevisiae . 

(A) Total RNA prepared from 0. 9ml of a iate-log culture was used for detecting msDNA with AMV-RT as 
described herein below. The RT reaction mixture was subjected to electrophoresis on a 6% sequence-urea- 

25 gel. An aliquot of the reaction mixture was treated with RNase A prior to gel electrophoresis. Lanes 1 and 
2 (G and C lanes, respectively) are DNA sequence laddere of pUC19 sequenced by chain termination 
method (Sanger etal., Proc. Natl. Acad. Sci. USA . 74. 5463-5467 (1977)) for size marks; tane 3, the AMV- 
RT products with total RNA from yeast ceils harboring YEp521-M1; lane 4. the same sample as lane 3 
except that it was treated with RNase A prior to gel electrophoresis; lane 5. the AMV-RT products with total 

30 RNA from yeast cell hartwring YEp521 . The sample was treated with RNase A. l-ane 6 is an Mspl digest 
of pBR322 labeled wits lr^^^]<\CTP wits the Klenow fragment of DNA polymerase 1 . Numbers at the right- 
sand side indicate fragment sizes in base paire and arrows with letters indicate positions of msDNA. 

(B) Schenrtatic representation of extension of the 3' end of msDNA-Ec67 by AMV-RT and RNase A treat- 
ment. 

35 FIG. 9 Southern blot hybridization of msDNA-Ec67 produced in S. cerevisiae. 

(A) Total RNAfiractions prepared from a 2. 5ml culture of yeast cells harboring YEp521-M1 (lane 1), and 
YEp521-M2 (lane 2) and from E. coli CL83 harboring pCL-1EP5c (lane 3) were used. After blotted to the 
nylon membrane filter, m8DNA-Ec67 was detected with the nick-translated 140-bp msr-msd DNAfragment 
as a probe. An arrowhead indicates the position of msDNA-Ec67. 

40 (B) Production of msDNA-Ec67 in S. cerevisiae harboring YEp521-M1. -M3, and-M4. Total RNA fractions 
prepared from a 2. 5ml culture of yeast cells hartDoring YEp521-M1 (lane 3).-M3 (lane 2). and-M4 (lane 1) 
were used for Southern blot hybridization as described in (A). An arrowhead indicates the position of 
msDNA- Ec67. 

FIG. 10 YEp521-M5 was constructed from YEp521-M4 by inserting into the msd region an Xhol site and 
45 into that site, cloning a 50-bp extraneous (foreign) dsDNA fragment which is complementary to mRNAof cdc28: 
step 1 . The Xhol site was added into the msd region of YEp521-M4 by PGR. This construct was then digested 
by Xhol; then the antisense DNA was ligated into the msd region of retron Ec67: step 2. This plasmid was trans- 
formed into yeast (SP-1) and the subsequently expressed msDNA designated herein as msDNA-Ye117. This 
is a novel structure. 

so 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS OF THE INVENTION 

In accordance with this invention, it has been discovered that a transfected yeast, Saccharomyces cere- 
visiae, produces a genetic structure, described as a synthesized, branched RNA-linked multicopy single- 
55 stranded DNA (msDNA). One such msDNA produced was msDNA-Ec67. msDNA-Ec67 was synthesized from 
retron- Ec67. 

The production of other representative msDNAs is described. 

Several msDNAs have been described in the literature. Some of these are the following: Mx162 (Dhundale 
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et aL, Cell, 51 1105-1112. 1987); Mx85 (Dhundale et aL, J. Biol. Chem 263. 9055-9058. 1988b); Sa163 (Fur- 
uichi et aL, CeH, 48, 47-52. 1987a and Furuichi et al., Ceil . 48, 55 62, 1987b); Ec67 (Lampson et al.. Science 
243, 1033-1038, 1989b); Ec86 (Linn and Maas. Cell, 56. 891-904. 1989); Ec73 (Sun et a!., J. Bacterio l 173. 
4171-4181. 1991); Ec107 (Herzer et aL. MoL Microbiol., submitted. August 1991); msDNAfrom E. coliB (Lim 
5 and Maas. CejL 56, 891-904. 1989). 

The msDNAs are often referred to in the literature by a numeral preceded by a suffix Indicating a host 
origin. For instance, "Mx" referring to Myxococcus xanthus. and " Ec". referring to E. coli and " Sa" to Stignna- 
telia aurantiaca. 

msDNAs are unique molecules which in spite of extensive diversity , share similar structural features. Gerh 
10 erically. msDNA may be described as being a nrK)lecule which comprises a branched RNA which is covalently 
linked to a single-stranded DNA by a 2'.5'-phosphodiester bond between the 2'-0H group of an internal rG 
residue and the 5'-phosphate of the DNA molecules and which RNA is non-covalently linked to the DNA by 
base pairing between the complementary 3' ends of the RNA and DNAnfK>!ecules. which RNA and DNA form 
the stable stem-loop secondary structures. The msDNA molecule is encoded by a primary RNA transcript, pre- 
15 msDNA. which contains an open-reading frame (ORF) downstream of the msr locus encoding a polypeptide 
which has sequence similarity with retroviral RTs and a highly conserved sequence common to RTs. 

The pre-msDNA way alternatively contain its ORF upstream of the msr locus in which event the retron 
will be of like construction. 

In FIG. 2 which shows typical msDNA, the RNA portion of the molecule is shown "boxed"; the balance of 
20 the structure being the ssDNA portnn. 

It will be noted that the molecules all show a branched rG residue, a DNA-RNA hybrid at the 3' ends of 
the nrtsDNAand msd RNA, and a stem-loop structure in the RNA and DNA strands. The branching ribonucleo- 
tide. G. is circled and the 2\5'-phosphodiester linkage to the first deoxynucleotide is indicated. 

Retrons: a retron is a small genetic element to date found to be of 1 .3 to 2.5-kb in length constituted of an 
25 msr-msd region and by the gene for encoding reverse transcriptase (RT). The coding region for msDNA is In- 
dicated by ' msd "; the coding regton for msdRNA is indicated by ' msr" . 

A comparison of all known msDNA coding regions reveal that this locus contains three genes organized 
in a similar manner (see FIG. 3). A gene called msd codes for t he DNA portion of msDNA. A second gene, msr , 
is situated 5' to 3', in the opposite orientation of msd. and codes for the RNA chain. Thus the genes msd and 
30 msr are convergently oriented so that their respective 3' ends overlap by several bases. 

This overiap is equivalent to the H-bonded DNA-RNA structure formed by the overlapping 3' ends of the 
RNA and DNA strands in the msDNA molecule. For Mx162. the overiapping msd - msr genes, like the hybrid 
structure of the msDNAthey produce, comprise 8 t>ase pairs. See Table I for typical overlap lengths of various 
msDNAs. 

35 Determination of the nucleotide sequence in the vicinity of the nrtsd-nrrsr genes revealed a dosely-l inked 
open reading frame (ORF). This ORF is located immediately upstream from msd. but is transcribed in the same 
direction as ir^ (as shown in FIG. 7). The Initiation codon of the ORF is situated as dose as 19 basepairs 
from the start of the msd gene for the Ec86 retron of E. coli B. but as much as 77 base-pairs for the Mx162 
retron of M, xanthus. 

40 Another conserved feature of the chromosomal locus that codes for msDNA is a set of inverted repeat 
sequences, designated a1 and a2. Sequence al is located just upstream from the start of the msd gene, while 
sequence a2 is positioned immediately 5' to the G residue in the msr gene that forms the 2'. 5' branch linkage 
in the msDNA molecule (FIG. 4). The inverted repeats display a large degree of nudeotide sequence diversity 
among the different known loci encoding msDNA. as well as differences in size. For example, the inverted 

45 repeats (al and a2) found in the retron locus encoding Mx162 are 34 nudeotides long, while the inverted re- 
peats for the Ec86 retron of E. coli B are only 12 bases in size. Despite their diversity, these repeat sequences 
are located in the same positions (as shown in FIG. 3) for all known loci encoding msDNA. As discussed in 
more detail below, the position of these inverted repeat sequences is critical to the synthesis of msDNA. 
It will be helpful to refer to the above discussion when the aspects of the invention are dscussed which 

50 provide for an inversion in the organization (position inversion) of the RT gene with respect to the msr-msd 
coding region, and in the discussion of shortening the non-coding region between the transcriptional initiation 
site and the initiation codon AUG of the RT gene. 

The promoter for the msr-msd region is upstream of msr. Transcription is from left to right, encompassing 
the entire region induding the RT gene. As described in further detail hereinafter, the replicating vehide for 

55 transfecting the eucaryote host may harbor one promoter for the msr-msd and the RT. or it may contain two 
promoters, one for the msr-msd region and the other for the RT. 

It is within the scope of the invention that retrons be constructed to yield an msDNA which differs from 
the typical msDNAs by features other than the common, conserved and characteristic features of msDNAs 
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described above. For instance, it is not excluded that the length and/or location of the set of IRs a1 and a2 in 
the retions be varied providing they remain in the same location discussed above. Thus, the size and/or lo- 
cation of the loops in the stems of the msDNAcan be varied. 

Further, it is not excluded that the extent of the overlap of the base pairing of the 3' ends of DNA and 
5 RNA in the nr^DNAs be influenced (increased or decreased) by appropriate manipulations. Whether such va- 
riations will be desirable will depend on the ultimate utility proposed for these msDNAs. 

General Features of msDNAs. Table I 6 a summary of the structure of representative retrons. 

Reverse Transcriptase (RT). The domain structures of bacterial RTs of representative retrons are shown 
in FIG. 4. 

10 The RT gene is normally located downstream from the msr-msd region. In the new retrons which differ 
from the bacterial retrons, their relative positions are reversed, the msr - msd region is located downstream of 
the RT gene. 

The biosynthesis of msDNAs has been described by Inouye & Inouye, Ann. Rev. Microbiol. . 45, 163-186 
(1991b) and Herzer et aL, Mol. Microbiol., submitted, August 1991. A schematic of the synthesis is shown in 

15 FIG. 1. A primary transcript (pre-msdRNA) is considered to encompass the upstream region of msr through 
the RTgene, which is by reference to FIG. 1 , shown by a thin line at step 1 . The thick region in the RNA transcript 
corresponds to the final msdR NA. The branched G residue is circled, and the initiation codon for RT \s also 
shown. On the folded RNA, a triangle indicates the 5' end processing site at the mismatching baserThe dotted 
lines at steps 3 and 4 represent DNA strands. 

20 In summary, the primary transcript from the msr-msd region is k>elieved to serve not only as a template 
but also as a primer to produce the msDNA. Synthesis of msDNA is primed from an internal rG residue of the 
RNA transcript using Its 2'-0H group. Thus, msDNA is branched out from this rG residue by a 2'-5'-phospho- 
diester linkage. 

There will be described hereinafter the transformation of yeast cells harboring plasmkis which contain a 
25 retron (which includes the RT gene) for expression of the desired msDNAs. The description is of a best mode 
to date to express msDNA- Ec67 from its retron, Ec67. 

1. Synthesis of msDNA-Ec67. For the expression of msDNA-Ec67, plasmid YEp52 was used. Plasmid 
YEp521 was constructed by introducing the multiple cloning sites of pUC19 (Yanisch-Perron et aL, Gene , 33, 
103-119, 1985) into YEp52, which was designed to obtain high-level, inducible expression of a cloned gene 

30 under the GAL10 pronrK)ter in yeast YEp52 contains the C0IEI origin of replication (OR), a promoter of the 
GALIO gene. LEU2, the 2^-circle origin of replication, and the 2-\i circle REP3 locus (SEE FIG. 5; See Broach 
et aL, Experimental Manipulation of Gene Expression, Academic Press Inc., New York, 1983). 

Retron-Ec67 was prepared from plasmid pCL-1 BPv4 in which the 4-kb Ball-Pvull fragment (DNA from frag- 
ment from the Ba[l to 2nd Pvul l site from the left end of the map depicted in FIG. 5 was cloned into the Hind i 

35 site of pUC9. E. ooH harboring this plasmid produces msDNA-Ec67 (Lampson et aL, Science 243, 1 033-1 038, 
1989). 

A total RNA fraction was prepared from cell transfected with pCI-1Ep5c ; pC1-1EP5c contains the 5-kb 
Pstl(a)- Eco RI fragment encompassing the entire 4-kb Ball -Pvu ll sequence in PC1-1BPv4. See FIG. 7. 

The construction of plasmid YEp521 proceeded as follows. The DNA fragment containing the pUCI 9 mul- 
40 tiple cloning sites were Isolated by digestion of pUC19 with EooRI, the cleaved ends were filled in with the 
Klenow fragment of DNA polymerase 1, and then digested with Hindlll. The resulting 54-bp fragment was 
cloned into YEp52 by replacing a fragment between the Bel l (filled in with the Klenow fragment) and Hindlll 
sites, resulting in YEp521. 

YEp521, thus constructed, contains the multiple cloning sites from pUG19, except for EcoRI, downstream 
45 of t he GAL10 promoter. 

The 4-kb Hindlll- Bam HI fragment from pC1-1BPv4 (see FIG. 7) was cloned into the Hindlll and Bam HI 
sites ofYEp521. 

As a result, the msr-msd region and the RT gene of retron-Ec67 were placed downstream of the GAL10 
promoter. This plasmid is designated YEp521-M1 . 
50 Plasmi YEp521-M1 is illustrated in FIG. 7. The shade bars are the regions inserted in yeast vector, YEp521 . 
It will be noted that the RT of retron-Ec67 gene is located behind (downstream) the msr-msd region. 

2. Production of n^DNA in transformed yeast. A yeast strain (SP1 : a ura3 , leu2,trp1 , hls3,ade8,canr , gal2) 
was used. Transformation of the yeast ceils was carried out by the lithium acetate method of (Ito et aL, J. Bac- 
teriol, 153, 163-168, 1983). Yeast culturing was carried out as described below. msDNA was produced and 

65 was detected by extending the 3' end of msDNA by avian myeloblastosis virus reverse transcriptase (AMV- 
RT). This yielded a main product of 117 nucleotides. Treatment of this product with ribonuclease A resulted in 
a DNA of 105 nucleotides. These results are in good agreement with the sfructure of msDNA-Ec67 (see Lamp- 
son etaL,Sdence 243, 1033-1038, 1989). The production of msDNA-Ec67 was further conformed by Southern 
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blot hybridization. 

To determine whether the production of msDNA-Ec67 could occur in yeast without the RT genes from re- 
tain Ec67, the following work was performed. 

An RNA preparation from cells harboring YEp521 -M2 only containing the msr-msd region under the GAL10 

5 promoter, and described herein Bbove (see FIG. 7) was analyzed by Southern blot hybridization. As shown in 
lane 2, FIG. 9, no band corresponding to msDNA-Ec67 was detected, indicating that the RT gene from retron- 
Eo67 is essential for the msDNA synthesis in yeast cells. 

In a simOarmanner, CHO cells can be transformed using known strategies and techniques. The same could 
be done with HeLa cells or other vertebrate mammalian celts. 

10 Gene Rearrangement in Retrons. An Important finding in connection with the invention is that the yield 
of msDNA in transfected yeast cells is significantly improved by means which cause, it is believed, an increase 
in production of RT. One such strategy is to reduce by as many as possible the numbers of the AUG codons 
between the transcriptional initiation site of the GAL1Q promoter (or any other promoter used for that purpose) 
and the initiation codon of the RT gene. Best results were obtained when a portion of the 5' end non-coding 

IS region containing initiation translation codons AUG is deleted, but for the first AUG codon in closest proximity 
to the 5' end. 

Thus, it was found that a significant portion of the 5' end of the non-coding region was not essential to 
production of msDNA in yeast cells. Deletion of a portion of the nucleotide sequence containing the AUG co- 
dons significantly improved the yield of msDNA production. 

20 Specifically in YEp521-M1 (see FIG. 7), there are 417 -bp from the 5' end Hindlli site to the initiation codon 
GAAfor RT. (See FIG. 7 in Lampson et al.. Science 243, 103S-1038. 1989). Adeletion of the 240-bp sequence 
upstream of the msr gene finom the left hand most Hindlli site to immediately upstream of msr of YEp521-M1 
(see FIG. 7) was carried out 

For this purpose, the fragment of 140-bp msr-msd (including 5 extra bases upstream of msd and 18 exb^ 

25 bases at the 3' end of the msr-msd region upstream of msd) (after PGR amplification) and the 1 .8-kb RT gene 
(including 6 bases upstream) of the initiation codon of the RT gene (also after PGR amplification) (and 4 bases 
downstream of the termination codon), were cloned into the Hindlli and Bam HI sites of YEp521-M1 yielding 
YEp521-M3 (see FIG. 7). The yield of m3DNA-Ec67 in transfected yeast with YEp521-M3 was shown to be 
significantly increased, as discussed below. 

30 Another importantf inding made to substantially increase the yield of msDNAs in yeast is to transpose the 
positbn of the RT gene with respect to the msr-msd region. In bacterial retrons, the msr-msd region is In front 
of the RT gene; when the RT was moved upstream of the msr-msd region, a further increase in yield of msDNA 
was observed. This was accomplished as follows. 

Since the msr-msd region of YEp521-M3 still contains 3 AUG codons, YEp521-M4 (see FIG. 7) was con- 

35 structed, in which the order of the RT gene and the msr-msd region was reversed, i.e., the msr-msd region 
being positioned after the RT gene. In YEp521-M4, there is only one AUG codon between the left hand-most 
Hindlli site and the BamH I site (see FIG. 7), which exists in the multiple cloning sites of PUC1 9 (Yabisch-Perron 
et aL, Gene. 33, 103-119. 1985). This AUG codon » terminated by a termination codon, UAG after 5 codons. 
The initiation codon, GAA, for the RT gene was placed 6 codons after the termination codon in the same read- 

40 ing frame. 

YEp-M5 was then constructed from YEp521-M4 by adding 50-bp antisense DNAfor cdc28 into the region 
coding for msd (described in further detail below). Therefore, two plasmids were constructed in which the order 
of the RT gene and the msr-msd region was reversed. 

The yield of msDNA-Ec67 in yeast cells transfected with YEp521-M4 was compared with that of yeast cells 
45 transfected with YEp521-M3. YEp521-M4 brought about a further increse (about 8 fold) of yield over YEp521- 
M3. 

It has been reported that a ribosomal subunit (carrying Met-trNA"^ and various initiation factors) binds 
initially at the 5' end of mRNA and then scans through the mRNA stopping and then initiates translation at the 
first AUG codon in a favorable context (Kozak, J. Cell Biology 108. 229-241 , 1 989). From a recent survey of 

60 699 vertebrate mRNAs, GCCGCCACC AUG emerges as the consensus sequence for initiation of translation 
in higher eucaryotes (Boeke et al^ Cell, 40, 491-500, 1985). The survey reports the study of the 5' non-coding 
sequences of the 699 vertebrates mRNAs (all sequences to which access could be had in the literature). The 
mRNAsource of the vertebrates included human (muscle, skeletal, liver, intestinal, etc.), bovine, rat and others. 
Also in yeasts, AUG was reported to be the consensus sequence for initiation of translation. (Hamilton et a[, 

55 Nucl. Acids Res.. 15, 3581-3583, 1987) It is noted that Kozak, J. Cell Biology 108. 229241, 1989 also reported 
variations and exceptions to the rmre general rule described above. For instance, there are reported cases 
where initiation is not restricted to the first AUG codon, which therefore is not used exclusively, but includes 
other AUG codons in the vicinity of the 5' end. Further, inactivating the first AUG codon closest to the 5' end, 



7 




EP 0 532 380 A2 

followed ribosomes to initiate translation at another codon (DUG). 

As described herein above, the location of the Initiation codon of the ORF for various msDNAs can vary 
(e.g., 1 9-bp from the start of the msd gene for Ec86 retron and 77-bp for the Mxl 62 retron). Thus, one skilled 
in the art can adjust the length of the excised non-coding region of the retron when the above strategy is fok 
5 lowed. 

The finding in connection with the invention described above, namely, that a significant improvement in 
yield of n^DNAs takes place when AUG codons between the transcriptional site of the GAL1Q promoter and 
the initiation codon of the RTgene are deleted, but for the one AUG codon closest to the 5' end which is pre- 
served, is therefore consistent with the above-discussed literature reports. Accordingly, this finding made in 

10 accordance with the invention with respect to the production of msDNAs is not intended to be limited to yeast, 
but can reasonably be predicted to apply to other msDNA-producIng transfec^ed eucaryotes, in particular high- 
er eucaryotes like mammalian cells, e^ HeLa cells. CHO, COS-1 cells and others. 

The same observation can be made regarding the position of the RTgene upstream of the msr-msd region. 
This finding too is believed to have general applicability to the production of msDNAs in eucaryotes, as noted 

15 above. It is believed that these described strategies may contribute to an increase in RT and ultimately in yield 
of msDNAs. 

It will be apparent to one skilled in the art that the two strategies described (deletion of AUG codons and 
inversion of the respective positions of the RT gene and the msr-msd region, do not have to be performed 
together (as shown with respect to YEp521-M4), which is a best mode to date. For instance, the strategy may 
20 be performed without the deletion strategy, and vice-versa. Further, as noted above, any strategy which will 
contribute to the increase of the production of RT, is considered within the scope of the invention. 

The msDNAs which are synthesized from these new retrons are also new. 

As has been noted herein, it is not necessary that one promoterfor the RT gene and the msr-msd region 
be used. More than one can be used, one for the RT and one for the msr-msd region. When it is desired to 
25 use two promoters, either one or both of the strategies to inaease RT production namely the inversion and/or 
deletion strategy can also be used, as will be apparent to one skilled in the art 

It is noteworthy that the DNA sequences, which contain these unique retrons (due to the deletions and/or 
posit bn inversion) and which encode the new msDNAs, are new when compared to known bacterial retrons. 
So are the replicating vehicles carrying these retrons and the transfected eucaryotes hartx)ring these vehicles. 
30 They provide effective means to produce new single-stranded DNA in eucaryotes in improved yields. 

It is to be noted also that the two above-described strategies which have been discussed with respect to 
eucaryotes are applicable to msDNAs produced from modified retrons in procaryotes. 

The invention has been illustrated with an illustrative retron. Ec67. However, by a similar procedure, yeast 
can be made to produce other msDNAs. For instance, in a similar manner, retron Ec73 can be used to transform 
35 yeast strain SP1 to produce msDNA-Ec73. 

Likewise, a similar procedure can be followed to transform and produce msDNA-Mx65 in yeast from the 
necessary retron elements. See Dhundale et al, J BO , 263, 9055-9058, 1988. Its ORF codes for 427 amino 
acid residues. 

If it is desired to produce nDsDNA-Mx162 in yeast, the appropriate DNA fragment containing retron Mx162 
40 can be prepared from a 17.5-kb Sa[l fragment which is disclosed in Yee et aL, Cell , 38, 203-209, 1 984. Its ORF 
codes for 485 amino acid residues. 

For the expression of msDNA-Ec1 07. a similar strategy may be followed. The retron is a 1 .3-kb DNA frag- 
ment of which the 34-bp intergenic sequence between pyrE and ttk (in FIG. 4) Is deleted. The retron contains 
an ORF coding for 319 amino acid residues (from base 396 to 1352 in FIG. 2). The reference to Figures made 
45 hereinabove is to Dhundale et al^ Cell , 51, 1105, 1987. This retron is the smallest yet found in bacteria. 

The retron for Sal 63 was determined to be contained in a 480-bp DNAfiragment encompassing the msd 
and msr regions (Furuichi et aL). 

The refron for Ec73 was determined to be contained in a 3.5-kb sall{b)-EcoRI{c) fragment (see FIG. 1A 
of Sun et aL). For details on Ec73, see below. 
50 The retron for Ec86 was determined to be contained in a 3.5-kb PstI fragment (Lim and Maas. Cell , 56. 
891-904. 1989). 

Likewise, from retrons Sa163, Ec86 and Ec73. the corresponding msDNAs, msDNA-Sa163, msDNA-Ec86 
and msDNA-Ec73 may be produced in transfected yeast celts. If plant or mammalian vertebrate cells are used, 
appropriate manipulations and strategies will be followed. 
55 Similar techniques may be followed to express other msDNAs known or yet to be found or to be synthesized 

from theirrespective retrons. All of these retrons are expected to contain the elements necessary to synthesize 
the unique features of msDNAs, as is described herein. 

Thus, in general retrons containing the essential features described herein are useful to produce in vivo 
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in eucaryotes the stable (not degraded) msDNAs having the conserved and characteristic features described 
herein. 

msDNA-Ec73 is synthesized from retron Ec73 which is described by Sun et ah, Journal of Bacteriology . 
173, 4171-4181,1991. 

5 FIG. 2 therein shows the nucleotide sequence capable of synthesizing msDNA-Ec73, a 3.5-kb S(b)-E(c) 

fragment. It was determined that the first ATG codon at position 11,544 is the initiation for the necessary RT 
gene and the ORF for the RT is of 316 residues. 

It is to be noted that in all retrons known to date, the RT gene is located at 20 to 77-bp upstream of the 
msd gene (downstream of the msd gene). 

10 In all retrons studied to date, it is believed that the promoter elements serve as the promoters for both 
msdRNA synthesis and the ORF. For Instance, for msDISIA-Ec67, pronrwter elements in a -10 region TTGACA 
and In a -35 region TGAAT, are believed to fulfill this function (see Lampson et ah. Science. 243 . 1033-1038, 
1989) . However, in accordance with the invention, it is not essential that there be one promoter element for 
both components, but rather two promoter elements, one for initiating RNA polymerase transcription for the 

15 RT gene and the other for the msr-msd region. Thus the msr-msd regbn and the RT gene can be expressed 
under two independent promoters, which would be likely to complement each other. However, it appears at 
this time that at least for two of the msDNAs described herein (msDNA-Ec67 and msDNA-Ec73), the produc- 
~ tion of msDNA-Ec67 can only be complemented by the RT-Ec67 and not by the RT-Ec73 or vice-versa. 
Further, it is often desired to use a strong promoter rather than the native promoter. 

20 Another important emtx>diment of the invention relates to the in vivo production in eucaryotes of any DNA 
firagment(s), non-native or foreign, to the msDNA structure. Likewise, the vectors and the transfected eucary- 
otic hosts, carrying such foreign DNA fragment(s) are encompassed by the invention. The invention thus 
makes possible the synthesis in vivo in eucaryotes of stable msDNAs which encompass a foreign DNA frag- 
ment in the DNA portion or a foreign RNA fragment in the DNA portion of the DNA-RNA hybrid structure. Of 

25 particular interest are msDNAs which include a single-strand or DNA or RNA fragment which is complementary 
to the mRNA of a particular target gene (or fragment) thereof (antisense DNA or RNA). 

In one example of this embodiment, there was constructed a plasmid, YEp5 21-M5, into which there was 
inserted in the nrtsd region, nucleotides 299-426 of YEp521-M4 (the boxed region of the lower strand of FIG. 
7 of lampson et aL, Science. 243. 1033-1038, 1989), a Xho l restriction recognition site (TiCTAG) ; a foreign 

30 DNAfragment of 50-bp was inserted in this Xhol site. YEp521-M5 was transformed into yeast (SP-lj and the 
subsequently expressed msDNA designated herein as msDNA-Ye117 (see FIG. 2B). 

In a like manner, they may be inserted into the msr regbn of YEp521-M4 a restriction recognition site, or 
a DNAfragment. This retron may be transformed into yeast (SP-1), and the subsequently expressed msDNA 
is a new structure. It corresponds to the structure designated here as msDNA-Ye117, except that the newfbr- 

35 eign fragment is in the RNA portion of the msDIslA. 

The invention makes possible the construction of a system that may be used to regulate the production 
of genes. The modified nrisDNAs of the invention contain in the DNA portion, a done DNA fragment from a 
gene downstream of a pronnoter in the orientation promoting the production of antisense DNA or RNA(micR- 
NAs). The "micRNA" terminology has been applied to an RNA transcript which is an i]}RNA-intefering- 

40 complementary RNA (Coleman et aK, Cell, 37, 429^36 (1984) and literature references cited therein). "micR- 
NA' has been reported to inhibit the production of certain proteins (e.g. , OmpF). A similar regulation has been 
reported for a micRNA and the gene for the TnIO transposase gene. The gene for the micRNA and for the 
transposase are reported to be transcribed in opposite directions of the same segment of DNA, such that the 
5' ends of their transcripts can form a complementary hybrid. The hybrid is thought to inhibit the translation 

45 of the transposase mRNA. Coleman etaL, supra., report the construction of an artificial "mic" system designed 
to regulate the expression of any specific gene in E. coli. 

Various cell division cycle (cdc) genes are known; by now some 50 different cdc genes have been defined 
in terms of landmark events occurring during duplication of cellular molecules (e.g. . glycolic events). Various 
cdcgenes and their functions are described in Watson etaL, Molecular Biology of the Gene . Fourth Ed. (1987), 

50 Chapter 18. Amongst these are cdo4 required for initiation of DNA synthesis in the mitotic cell division cycle 
and other functions; cdc7 of similar functk)n to cdc4 but for premeiotic DNA synthesis; cdc28 necessary for 
duplication of the spindle pole body is homologous to mammalian protein kinases and has protein kinase ac- 
tivity, and others like cdcB, cdc9 and others. 

The strategy to produce a msDNA containing a foreign dsDNA fragment in its DNA portion is depicted in 

55 FIG. 10. The DNAfragment is shown (dark bar). The 50 bp nucleotide fragment has the following sequence 
(SEQ ID No. : 1): 
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^' TCGATGTAATrEGCTAATTCACCGCTCATGT^^ ^' 

5« 

ACAOTAAACGATTAACTGGCGACTACAAGCirrcCTATCAAfiATJ^ 

5 

Yeast cells (SP-1) transfected with YEp521-M5 produced a new msDNA-like structure, msDNA-Ye117, 
shown in FIG. 2B (analyzed by polyacrylamide-urea gel electrophoresis). This new ntsDNA construct contains 
the SQ-bp DNA fragment It Is contemplated that the new msDNA is a useful vector for antteense DNA. The 
new construct is expected to produce a single-stranded DNA which is complementary to a specific mRNA, in 
10 this instance, that of cdc28 and inhibit the expression of that mRNA, and of the gene. 

Antisense DNA (micDNA) and micRNAs which are complementary to regions of the mRNA known to in- 
teract with ribosomes, would be of particular interest. Hence, such nr^DNAs that contain such DNA-micDNA 
generating regions are of special interest for various applications. Thus, by inserting an appropriate DNA frag- 
ment of a gene after a promoter, e.g., into an Xho l site, one can construct with the msDNAs disclosed herein 
15 (and others) a system to specifically regulate the expression of any gene. 

This is the first time that such antisense system has been provided from a molecule produced In an eu- 
caryote. 

It is contemplated that other DNA fragments be inserted in the msd region and/or the msr region of the 
plasmid here disclosed and the corresponding new msDNAs synthesized which nnay have similar functions, 
20 e.g., to generate a micDNA or a micRNA complementary to a mRNA to inhibit its gene. 

Likewise, YEp521-M1 can be modified, producing an enlarged new msDNAstruc^re (on the 5' end of the 
DNA portion of msDNA). 

When it is desired to insert a DNA sequence encoding a protein (polyeptide) e.g., two copies of a gene, 
the DNA sequence will be inserted in opposite orientation to another at a selected restriction site into the msd 
25 sequence of an msDNAof choice, such as YEp521-M4. There is expected to be produced in an eucaryotic 
host, a novel msDNA-RNA structure. When the lacZ gene is incorporated into a suitable location in the msd 
region of the selected construct, it is expected that p-galactosidase activity will be detected. 

EXAMPLES 

30 

The follo\A/ing Examples are offered by way of illustration and ara not intended to limit the invention in any 
manner. In these Examples, all percentages are by weight for solids and by volume for liquids, and all temper- 
atures are in degrees Celsius unless otherwise noted. 

For convenience and clarity, the Examples refer to and provide also a detailed description of the Figures. 

35 

Example 1 

Yeast Strains, Media and Growth Condition 

40 Yeast SP1 strain (a uraS Ieu2 trp1 hisS adeS can£gal2) was used. Cells were grown in YPD medium (1% 
yeast extract, 2% bactopeptone, and 2% glucose). For screening transformants of YEp52 and its derivatives, 
a minimal nrtedium was used (Rose et aL, Methods in Yeast Genetics: A Laboratory to Course Manual . Cold 
Spring Harbor Lab., Cold Spring Harbor, NY, 1990), supplemented with all nutrients required but leucine. For 
galactose Induction, 0. 15ml of the pre-cultured cells in the minimal medium containing 2% galactose instead 

45 of glucose wera utilized. The cells were grown at 30° until late-log phase. Yeast transformations were earned 
out by the lithium acetate method (6). Transformatton of yeast cell was confirmed as follows: the plasmid pre- 
pared from yeast transformants was transformed into E. cgli DH-5 (F"endA1 recAl hsdRI 7 (rk", k^) supE4 4 
thi-1 . gyrA9 6, relA D and the plasmid prepared from DH-5 cells was not yet subsequently characterized. Plas- 
mid DNA from yeast cells was prepared according to the method described by Hoffman and Winston, Gene . 

50 57,268-272,1987. 

Plasmids : YEp52 (broach et aL, Experimental Manipulation of Gene Expression . Academic Press Inc., 
New York, 1983) was used to construct plasmids for expressbn of msDNA in yeast This plasmid contains the 
ColEI origin of replication, a promoter of the GAL10 gene, LEU2. the 2^-circle origin of replication, and the 
2n-circle REP3 locus. Retron-Ec67 was prepared from plasmid pC1-1 BPv4 in which the 4-kb Bal l-Pvull firag- 
55 ment (DNA fragment from the BA1I to the 2nd Pvu1 l site from the left end of the map described in FIG. 5 of 
Lampson et aL, Science 243, 1033-1038, 1989), was cloned into the Hindi site of pUC9. E. coH harboring this 
plasmid produces msDNA-Ec67. A total RNA fraction was prepared from ceils transformed with pC1-1EP5c. 
pCL-1EP5c contains the 5-kb Pstl(a)- Eco RI fragment encompassing the entire 4-kb Ba1-l- Pvu ll sequence in 
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pC1-1BPv4 (see FIG. 5 of Lampson et aL, Science. 243, 1033-1038. 1989) in pUC9. 

Ptasmid Construction: plasmid YEp521 was constructed by introducing the multiple cloning sites of pUCI 9 
(Yanisch-Perron et al.. Gene, 33, 103-119, 1985) into YEp52 (Broach etaL, Experimental Manipulation of Gene 
Expression, Academic Press Inc., New York, 1983), which was designed to obtain high-level, inducible expres- 

5 sion of a cloned gene under the GAL10 promoter in yeast The DNA fragment containing the pUC19 nrtultiple 
cloning site was isolated by digestion of pUC19 with Eco RI; the cleaved ends were filled in with the Klenow 
fragment of DNA polymerase 1, and then digested with Hindlll. The resulting 54-bp fragment was cloned into 
YEp52 by replacing a fragment between the Bel I (filled in with the Klenow fragment) and Hindlll sites, resulting 
In YEp521 . YEp521 , thus constructed, contains the multiple cloning sites from pUCI 9, except for EogRI, down- 

10 stream of the GAL10 promoter. The 4-kb HindlH- Bam HI fragment from pC1-1BPv4 was doned into the Hindlll 
and Bam HI sites of YEp521 . As a result, the msr-msd region and the RT gene of retron-Ec67 were placed down- 
stream of the GAL10 promoter This plasmid is designated YEp521-M1 as shown in FIG. 7. 

In order to eliminate a fragment of 242 bases upstream of msr which contains several ATG codons, poly- 
merase chain reaction (PGR) was performed using YEp521-M1 as a template with two synthetic oligonudeo- 

15 tides, M2-a (S'GCAAGCTTCATAAACACGCATGP': SEQ ID No.: 2) and l\^2-b (S'CTGGATCCAGAAACGCATG- 
CAGG^: SEQ ID No. : 3) as primere. 

These sequences correspond to the sequences firom base 243 to 258 of retron Ec67 for M2-a and from base 
384 to 369 for M2-b (see FIG. 7 of Watson et al. which flank the msr-msd region. The 140-bp PGR product 
was gel-purified and digested with Hindlll and BamH I. The resulting fragment was cloned into the Hind III and 
20 Bam HI sites of YEp521 . yielding YEp521-M2. Yep521-M2 contains only the msr-msd region under the GAL10 
promoter. 

To insert the RT gene at the Bam HI site of YEp521-M2, the 1.8-kb Bam HI fragment encompassing the 
RT gene was aniplif led by PGR using YEp521-M1 as a template and two oligonudeotides, M3-a (^'CTGGATC- 
CAAGAAATGACAAAAACA3': SEQ ID No.: 4) and M3-b(s^CTGGA TCCTT CATTAGCTATTTAAC AP': SEQ ID 
25 No.: 5) as primers which correspond to base 409 to 429 and from base 2182 to 2163 of retron-Ec67 (see FIG. 
7 of Lampson etaly Science, 243, 1033-1038, 1989), respectively. The 1.8-kb fragment was gel-purified, di- 
gested with BamH I. and closed into the Bam HI site of YEp521-M2. The resulting plasmid was designated 
YEp521-M3. 

YEp521-M4 was constructed to change the order of the msr-msd region and the RT gene. The msr-msd 
30 region was amplified by PGR using M2-a and M2-b (see above) except that Sma l sites were added at their 5' 
ends. The 1.8-kb Bam HI fragment containing the RTgene was cloned into the Bam HI site of YEp521. Subse- 
quently, the 140-bp Smal fragment containing the msr-msd region was doned into the Smal site of the above 
plasmid and the resulting plasmid was designated YEp521-M4. 

YEp521-M5 was constructed from YEp521-M4 to add the 50-bp antisense DNA for cdc28 (Xho l fragment) 
35 into the msd region. The Xhol site was added into the msd region of YEp521-M4 by PGR. This construct was 
then digested by Xho l; then the antisense DNA was ligated to the msd region of retron Ec67. This plasmid 
was transformed into yeast (SP-1) and the subsequently expressed msDNA designated herein as msDNA- 
Yell 7. 

Detection of msDNA : a total RfslA fraction from yeast cells was prepared as described by Elder etaL, Proc. 

40 Natl. Acad. Sci. USA. 80, 2342-2348, 1 983 and a total RNA fraction from E. coli was prepared from E. odi har- 
boring pCI-1EP5c by the method described by Chomzynski et ah, Anal. Btochem, 162. 156-159, 1987. 

To label msDNAwith reverse transcriptase, the total RNAfiraction prepared from 0.9ml of a late-log culture 
was added to 20 ^1 of a reaction mixhjre containing 30mM Tris-HCI (pH 8.2), 50mM KGI, 10mM MgGl2, 5mM 
DTT, 0. 2mM each of dTTp, dGTP, dCTP, 5 \xC\ of [y-32p]dATP and 5 units of avian myeloblastos^ virus reverse 

45 transcriptase (AMV-RT; Molecular Genetic Resources). The reaction mixture was incubated at 30''C for 1 hour, 
and an aliquot of the reaction mixture was subjected to electrophoresis with a 6% polyacrylamide -8M urea 
gel. Another aliquot was treated with RNase A(10 for 10 minutes at 37''C and subjected to electrophor- 
esis. 

msDNA-Ec67 was also detected by Southern blot analysis (Southern, Mol. Biol.. 98, 503-517, 1975). Total 
50 . RNAfiPom 2.5ml of a late-log culture was applied to a 1.5% agarose gel with E buffer [40mM Tris HGI (pH 8.0), 
1 0mM sodium acetate, 2mM EDTA). After electrophoresis, the gel was blotted to a nylon membrane filter (PALL 
BLODYNE A TRANSFER MEMBRANE; IGN) by the capillary transfer method. Hybridization was carried out 
in 50% (v/v) formamide, 5x SSPEfl x SSPE; 180mM NaCI. 10mM sodium phosphate (pH 7. 4), 1mM EDTA 1, 
0. 3% sodium dodecyl sulfate, and 5 x Denhardf s solution (Denhardt, Biochem. Biophvs.Res. Commun. .23. 
55 641-646 (1966)) with the nick-translated 140-bp msr-msd region as a probe (Rigby et aL, J. Mol. Biol. . 113, 
237-251. 1977). 

As noted above, the invention provides for the expression of the desired msDNAsfrom eucaryotes in gen- 
eral. While the invention has been illustrated with a yeast of the genus Saccharomyces . others are readily suit- 
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able to practice the invention. 

Aconvenient source of suitable yeasts is found in the ATCC Catalogue of Yeasts. 18th Ed., 1990. Because 
of practical and economic importance, the invention is partiojlarly directed to the genus Saccharomyces which 
b extensively used in baking, beer, wine and other Industries. Conventionally these yeasts are referred to as 
5 baker's, brewer's and wine yeasts. 

Amongst these, of special interest are the S.oerevisiae strains, the S. bayanus , S. carlsbergenensis, S. 
diataticus, and S. uvarum. which lend themselves to transformation with the vectors of the invention. Further, 
to express the msDNAs, one may use vertebrate host cells like COS-1, CHO and HeLa cells or Invertebrate 
cells or plant cells. 

10 Plants that may be used include monocotyledons and dicotyledons. Illustrative examples of plants which 
may be transformed are the following: alfalfa, soybeans, nnaize and wheat (Genetic Engineering of Plants, An 
Agricultural Perspective, Edited by Kosuge et aL, Plenum Press (1983)). 

To carry out the present invention, various cloning vectors may be used to transfect compatible eucaryotic 
host celts for replication of the vector. Thereafterthe transformants are identified, plasmid DNA prepared there- 

15 from, and the msDNAs extracted and purified. 

Vectors for expression of cloned genes in yeasts are described in Methods In Enzymology, Vol. 1 94, " Gui- 
de to Yeast Genetics and Molecular Bk>logy', page 373 (Guthrie and Fink, Eds., Academic Press Inc.^ , 1 991). 
It will be apparent from one skilled in the art to select an appropriate promoterfor expressing msDNAs in yeast 
with or without a foreign DNA fragment, such as from regulatable promoters of the GAL family, e.g. , GAL4, 

20 GAL80, GALI. GAL2, GAL7. GAL10. GALII. MEL1.ADH1 and PGK; also see Broach et aL. Experimental Ma- 
nipulation of Gene Expression, Academic Press, Inc., New York, 1983 or non-regulatable strong promoters. 

Oligonucleotide synthesis may be carried out by a number of methods including those disclosed in U.S. 
Pat No. 4, 415, 734, and in Matteuci et al., J. Am. Chem. Soc. , 103 (11):31B5-3191 (1981). Adanris et al,. J, 
Chem. Soc, 105 (3): 661-663 (1983). and Bemcage et aL, Tetrahedron Letters , 22 (20): 1859-1867 (1981). 

25 For the expression of msDNAs in higher eucaryotes with or without selected DNA fragment, one skilled 
in the art may refer to and use known techniques. The advantages of synthesizing particular eucaryotic pro- 
teins in eucaryotes are well known. Depending on the msDNA which is intended to be produced, an appropriate 
eucaryote host cell will be seleded (see Molecular Cloning: A Laboratory Manual , Second Edition, §3, §16.3 
and seg . (Sambrook et §[., Gold Spring Hartx)r Laboratory Press, 1989)). The eucaryotic expression vehicle 

30 will contain, as is known, a promoter and enhancer elements, recognition sequences, the TATA box and up- 
stream promoter elements. Other conventional elements located upstream of the transcription initiation site 
for replication and selection are known and described in standard laboratory manuals. Vectors are avaOable 
commercially, for Instance firom Pharmacia (pMSG, pSVT17, pMT2). For methods for introducing recombinant 
vectors into mammalian cells, see Molecular Cloning: A Laboratory Manual , Second Edition, §16. 30-16. 55, 

35 (Sambrook et al., Cold Spring Harbor Laboratory Press, 1989). For cosmid vectors for transfection of mam- 
malian cells, see Molecular (Zoning: A Laboratory Manual, Second Edition, §23.18 and seg. . (Sambrook etaL, 
Cold Spring Harbor Laboratory Press, 1989). 

Further, one skilled in the art may wish to refer to Current Protocols In Molecular Biology . Volume 1, §16. 
12-16. 13.7 (Ausubel et ah , Eds. , Greene Publishing Associates and Wiley-lnterscience, 1989), discussing 

40 in particular, three vector systems or strategies for introducing foreign genes into mammalian cells with COS 
cells, CHC and vaccinia viral vectors. One skilled in the art will select the most appropriate system for the pro- 
duction of msDNAs from the selected retrons, and Further, for introduction of DNA into mammalian cells (see 
Current Protocols In Molecular Biology. Volume 1, §9. 01-9.93 (Ausubel et al., Eds., Greene Publishing Asso- 
ciates and Wiley-lnterscience, 1989)). 

45 The msDNAs have several Interesting utilities. 

A fascinating utflity that is being considered Is the role that msDNAs of the invention can play on the for- 
mation of triple helix DNA, or triplex DNA with a specific duplex on the chromosome. Arecent report in Science , 
252. 1374-1375 (June 27. 1991), 'Triplex DNA Finally Comes of Age", highlights the timeliness of the present 
invention. Triplex DNA can be formed by binding a third strand to specific recognized sites on chromosomal 

50 DNA. Synthetic strands of sizes preferably containing the full complement of bases (such as 11-15 and higher), 
are discussed. The msDNAs of the invention with long 3'(or 5') ends (and the loop of non-duplexed bases) 
would appear to be excellent candidates. These regions provide single-stranded DNA necessary for the triplex 
formation. The resulting triplex DNA is expected to have increased stability and usefulness. New therapies 
based on the triple helix formation, including in AIDS therapy and selective gene inhibition and others are pro- 

55 posed in the Report. 

Artificial, synthetic msDNAs can be designed and may be used as antisense DNAs, and/or RNAs and/or 
ribozymes using the single-stranded DNAor RNA region of msDNAs. Such msDNA(oontaining a foreign ssDNA 
or ssRNA fragment) for use as antisense system, been described above. The production of an nrisDNA with 
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complementarity with a gene (or portion) thereof, blocks the synthesis of the specific protein itself. The msDNA 
system produced in eucaryotic cells to generate a desired complementary DMA of an mRNAof a gene, appears 
to have real potential in eucaryotic cells to block the expression of various harmful or toxic genes, such as 
drug resistance, oncogenes, and phages or viruses. The system could have applications to AIDS therapy. Of 

5 special interest, are the msDNAs that would be produced by HeLa cells and containing such selected DNA 
fragment for use In antisense applications. 

As described above, it is contemplated that genes be inserted for instance, in the the stem region (s) of 
the msDNAs. Thus the msDNAs may be used for amplificatk>n of the selects gene. 

The polymerase chain reaction (PGR) is a well-known rapid procedure for in vitro enzymatic amplification 

10 of a specific segment of DNA. The standard PGR method requires a sequent of double-stranded DNA to be 
amplified, and always two-singie stranded oligonucleotide primers flanking the sequent, a DNA polymerase, 
appropriate deoxyribonudeoskle triphosphate (dNTPs), a buffer, and salts (Currant Protocols . Sectbn 15). 

Thus, the msDNAs due to their unique structure (and stabOity). are expected to be of value in numerous 
applications in the biochembal, medical, pharmaceutical and other biological sciences. 

15 It can be seen that the present invention is providing a significant contribution to arts and science. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: University of Medicine and Dentistry of 
New Jersey 

(ii) TITLE OF INVENTION: Method for synthesizing stable 

single-stranded cDNA in eukaryotes by means of a bacterial 
retron, products ans uses therefor. 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cabinet Beau de Lomenie 

(B) STREET: 55. Rue d' Amsterdam 

(C) CITY: Paris 

(E) COUNTRY: FRANCE 

(F) ZIP: 75 008 

(v) COMPUTER READABLE FORM: 

{A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYST05: PC-DOS/MS-DOS 

(D) SOFTWARE; Paten tin Release #1,0. Version #1.25 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/753.110 

(B) FILING DATE: 30-AUG-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: GILLARD, Marie-Louise 

(C) REFERENCE/DOCKEn' NUMBER: MLG.MHC/J16983-4 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: ^2 80 6** 68 

(B) TELEFAX: ^8 ^^ 37 60 

(C) TELEX: 650476F 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5U base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TCGATGTAAT TKXJTAATTC ACCGCTCATG TTCGAAGGAT AGTTCTATTT CATC 
(2) INFORMATION FX}R SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
GCAAGCrrCA TAAACACGCA TOT 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOrmETICAL: YES 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTGGATCCAG AAACGCATGC AGG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:i*: 

15 CTGGATCCAA GAAATGACAA AAACA 25 

(2) INFORMATION FOR SEQ ID NO: 5: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTGGATCCTT CATTAGCTAT TTAACAT 27 



Claims 



1. Amet hod for producing a stable msDNA in a eucaryotic host cell transfected with a DNA expression vector 
capable of replication in the eucaryotic host ceil, which vector contains a retron capable of ms DNA syn- 
thesis in the eucaryotic host cell. 

45 2. The method of claim 1 wherein the expression vector is a plasmid. 

3. The method of daim 2 wherein the eucaryote host cell is yeast. 

4. The method of daim 3 wherein the yeast is of the genus Saccharomyces . 

^ 5. The method of daim 1 wherein the eucaryote host cell is a plant or mammalian cell. 

6. The method of claim 1 wherein the retron for msDN A contains the msr-msd region and the gene encoding 
for RT. 

55 7. The method of claim 3 wherein in the retron, the RT gene is downstream of the msr-msd region. 

8. The method of claim 3 wherein in the retron, a non-coding DNA fragment of the 5' end upstream of msr 
which contains several ATG codons. has been deleted but for the initiation codon AUG of the RT gene. 
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9. The method of daim 8 wherein in the retron, the RT gene is upstream of the msr-msd region. 

10. The method of daim 3 wherein the expression vector contains a promoter for a retron consisting of msr- 
msd region and an RT gene, which promoter is located upstream of msr . 

11. The method of daim 10 wherein the pronfK)ter is a foreign strong promoter. 

12. The method of daim 11 wherein the strong promoter is GAL10 . 

13. The nnethod of daim 2 wherein in the retron in the plasmid, the msd sequence contains a doning site. 

14. The method of daim 9 wherein a foreign DNA fragment is contained in the msd region of the retron which 
codes the DNA of the ms DMA. 

15. The method of claim 1 wherein the retron is in a vector selected from the group consisting of YEp521- 
M1. YEp521-IVI3. YEp521-IVI4 and YEp521-M5. 

16. The method of daim 4 wherein the msDNA moleojle comprises a branched single-stranded RNA which 
is cpvalently linked to a singte-stranded DNA by a 2'. 5'-phosphodiester bond between the 2'-0H group " 
and an internal rG residue and the 5'-phosphate of the DNA molecule, which RNA is non-covalently linked 
to the DNA by base-pairing between the complementary 3' ends of the RNA and DNA molecules, which 
RNA-DNAfbrm stable stem-loop secondary structures, which msDNA is encoded by a primary RNA tran- 
script pre-nnsDNA which contains an open-reading frame (ORF) downstream of the msr locus, the ORF 
encoding a polypeptide which has sequence simOarity with retroviral RTs and a highly conserved se- 
quence common to all RTs. 

17. The method of daim 1 6 wherein the stable msDNA which is produced is selected from the group consisting 
of msDNA-Ec67. msDNA-Mx162, msDNA-Mx65. msDNA-Ec107. nrteDNA-Mx86 and Sa163. 

18. The mthod of daim 4 wherein the ORF is located upstream of the m^ locus. 

19. The method of daim 18 wherein the stable msDNA which is produced is msDNA-Ec67. 

20. A DNA expression vector capable of replication in an eucaryotic host which comprises a retron which is 
capable of encoding a stable hybrkd single-stranded RNA-DNA(msDNA) structure, which retron contains 
a gene encoding a reverse transcriptase (RT) and one coding region which contains two coding sequenc- 
es, one " msr^ and the other " msd" for encoding, respectively, the RNA and DNA portions of the msDNA. 

21. The DNA expression vector of claim 20 wherein in the retron, the RT gene is downstream of the msr-msd 
region. 

22. The vector of daim 21 wherein in the retron a non-coding fragment of the 5' end upstream of msr which 
contains several ATG codons has been deleted but for the initiation codon AUG of the RT gene. 

23. The vector of claim 22 wherein in the retron, the RT gene is upstream of the msr-msd region. 

24. The vector of daim 20 which indudes a promoter for the msr-msd region and the RT gene, which promoter 
is located upstream of msr. 

25. The vector of daim 24 wherein the promoter is a foreign strong promoter 

26. The vector of claim 25 wherein the promoter is GAL1 0 . 

27. The vector of claim 23 in which a foreign DNA fragment is contained in the msd regton of the retron which 
encodes the DNA of the msDNA. 

26. The vector of daim 27 wherein the fragment is 50-bp long. 

29. The vector of daim 20 which Is YEp521-M1, YEp521-M3. YEp521-M4 or YEp521-M5. 

30. A eucaryotic host cell transfected with a DNA expression vector capable of replication, which vector con- 
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tains a retron capable of msDNA synthesis. 

31. The eucaryotic host cell of claim 30 which is selected from the group consisting of yeast, plant cells and 
mammalian cells. 

32. The eucaryotic host cell of claim 31 which is yeast. 

33. The eucaryotic host cell of claim 32 wherein the yeast is of the genus Saccharomyces . 

34. The eucaryotic host cell of claim 30 wherein the retron contains the msr-msd region and the gene en- 
coding for RT. 

35. The eucaryotic host cell of da im 34 wherein in the retron, the RT gene is downstream of the msr-msd 
regbn. 

15 36. The eucaryotic host cell of claim 41 wherein in the retron a non-coding fragment of the 5' end upstream 
of msr which contains several ATG codons, has been deleted but for the initiation oodon AUG of the RT 
gene closest to the 5' end. 
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37. The eucaryotic host cell of claim 36 wherein in the retron, the RT gene is upstream of the msr-msd region. 

38. The eucaryotic host cell of claim 31 wherein the expression vector contains the promoter for the msr-msd 
region and RT gene, which promoter is located upstream of msr. 

3d. The eucaryotic host cell of claim 38 wherein the promoter is a foreign strong promoter 

40. The eucaryotic host cell of claim 39 wherein the strong promoter is GAL10 . 

41. The eucaryotic host cell of claim 37 which contains a retron having an msd region which contains a foreign 
DNA fragment 

30 42. The eucaryotic host cell of claim 41 wherein the DNA sequence is 50-bp long. 

43. A new msDNA molecule constituted of a single-stranded RNA and a single-stranded DNA portion which 
molecule comprises a branched single-stranded RNA which is covalently linked to a single-stranded DNA 
by a 2', 5'-phosphodlester bond between the 2 -OH group and an internal rG residue and the 5'-phosphate 

35 of the DNA molecule, which RNA is non-covalently linked to the DNA by base-pairing between the com- 

plementary 3'ends of the RNA and DNA molecules, which RNA-DNA form stable stem-loop secondary 
structures, which ms DNA Is encoded by a primary RfslA transcript, pre-msDNA which contains an open- 
reading frame (ORF) upstream of the msr locus, the ORF encoding a polypeptide which has sequence 
similarity with retroviral RTs and a highly conserved sequence common to all known RTs and which 

40 msDNA is encoded by an msr-msd region, the msDNA containing a foreign DNA fragment in its 5'end 

strand of the DNA in the region which was encoded by the msd region of the retron which encoded the 
DNA of the msDNA. 
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44. The new msDNA of claim 43 wherein the msDNA is encoded by either plasmid YEp521-M4 or plasmid 
YEp521-M5. 

45. The new msDNA of claim 43 which contains in its DNA portbn which terminates by the 5'end a foreign 
DNA fragment which is capable of generating an RNA transcript which is complementary to a specific 
mRNA of a target gene or fragment thereof. 

46. The new msDNA of daim 45 wherein the complementary sequence Is (SEQ ID N"*: 1) : 



^* TCGATGTAATTIXKTAATTCACCGCrCATGTTCGAAGGATAGrrT^ ^' 
55 ACATTAAAOGATTAAGlXSGCGAGTACAAGCrrCVrA TCAAGA 

47. The new msDNA of daim 46 wherein the generated RNA is complementary to cdc28. 
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